AtomS3R-M12 Volcengine Kit

SKU:D062-M12

Description

AtomS3R‑M12 Volcengine Kit is an IoT vision+voice development kit that deeply integrates M5Stack hardware with Volcengine’s cloud AIGC one-stop solution. It consists of two core parts: the high-performance image capture unit AtomS3R‑M12 and the AI voice processing base Atomic Echo Base. AtomS3R‑M12 provides 3 MP wide-angle video capture and edge computing capabilities, with expansion interfaces for various sensors. Atomic Echo Base integrates high-fidelity audio decoding, microphone, and speaker drivers, supporting full-duplex voice wake-up, recognition, and interaction. Volcengine RTC, in collaboration with M5Stack, offers a built-in one-stop solution that integrates advanced audio processing (including wake‑up and audio 3A) on the chip side, and deeply incorporates large models, speech recognition, speech synthesis, function calling, and knowledge-base technologies on the cloud side, quickly achieving smooth, natural, human-like real-time communication between users and hardware. It is widely applied in smart security, remote education, smart home, industrial monitoring, AI robotics, and other fields.

Product Features

Volcengine RTC real-time communication
AI visual recognition
AI voice recognition
Edge-to-cloud collaboration & model management
Integrated ESP32‑S3‑PICO‑1‑N8R8 SoC
3 MP OV3660 camera (120° FOV)
Nine‑axis sensor system
Edge AI inference
8 MB Flash & 8 MB PSRAM
Infrared emission control support
Expandable pins & interfaces
Full‑duplex I2S audio
24‑bit audio codec
MEMS digital microphone
Class D amplifier (8 Ω @ 1 W speaker)
Development platforms
- Arduino IDE
- ESP‑IDF
- PlatformIO

Includes

1 x AtomS3R‑M12
1 x Atomic Echo Base

Applications

Smart security
Remote education
Smart home
Industrial monitoring
AI tutoring
STEAM education

Specifications

Specification	Parameter
SoC	ESP32‑S3‑PICO‑1‑N8R8, dual‑core Xtensa LX7 @240 MHz, USB‑OTG
Storage	8 MB Flash + 8 MB PSRAM
Wireless	Wi‑Fi 2.4 GHz
Cloud Stream Processing	Volcengine Stream real‑time stream access
Cloud Recognition	Face detection, target tracking, OCR text recognition, ASR speech‑to‑text
Camera	OV3660, 3 MP, F2.4 aperture, 120° FOV, 30 FPS
Infrared IR	180° emission angle, up to 12.46 m without obstruction
Sensor System	Nine‑axis (BMI270 + BMM150)
Interfaces	USB‑C (power/UVC plug‑and‑play), HY2.0‑4P expansion
UVC	USB Video Class plug‑and‑play
Edge AI	ESP32‑S3 + TinyML: on‑device image detection, keyword wake‑up
Audio Codec	ES8311, 24‑bit I2S, 16 kHz–64 kHz
Microphone	MEMS digital microphone, SNR ≥ 65 dB
Amplifier	NS4150B Class D
Speaker	1 W @ 8 Ω
Communication Mode	I2S full‑duplex
Operating Temperature	0 ~ 40 °C
Product Dimensions	AtomS3R‑M12: 26.4 × 24.0 × 22.5 mm Atomic Echo Base: 26.4 × 24.0 × 22.5 mm
Product Weight	AtomS3R‑M12: 10.8 g Atomic Echo Base: 10.8 g

Learn

Download Mode

To flash firmware, press and hold the reset button (for about 2 seconds) until the internal green LED lights up, then release; the device will enter download mode and wait for flashing.

Schematics

1/3

PinMap

BMI270 & IR & RGB

ESP32-S3-PICO-1-N8R8	G0	G45	G47
LP5562 (RGB control chip)	SYS_SCL	SYS_SDA
BMI270	SYS_SCL	SYS_SDA
IR			IR_LED_DRV

BMM150

BMI270	BMI270_ASDx	BMI270_ASCx
BMM150	A_SDA	A_SCL

BMM150 mounted on BMI270

Access BMM150 via BMI270’s Sensor Hub auxiliary I2C interface for unified 9‑axis sensor data collection

OV3360 (M12)

OV3360 (M12)	ESP32-S3-PICO-1-N8R8
CAM_SDA	G12
CAM_SCL	G9
VSYNC	G10
HREF	G14
Y9	G13
XCLK	G21
Y8	G11
Y7	G17
PCLK	G40
Y6	G4
Y2	G3
Y5	G48
Y3	G42
Y4	G46
POWER_N	G18

Atomic Echo Base

Atomic Echo Base	SCL	SDA	SD/DSDIN	WS/LRCK	ASDOUT	SCK/SCLK
AtomS3R M12	G39	G38	G5	G6	G7	G8

HY2.0-4P

HY2.0-4P	Black	Red	Yellow	White
PORT.CUSTOM	GND	5V	G2	G1

Model Size

1/2